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A method for site-specific and high yield modification of tobacco 
mosaic virus coat protein (TMVCP) utilizing a genetic code expand- 
ing technology and copper free cycloaddition reaction has been 
established, and biotin-functionalized virus-like particles were built 
by the self-assembly of the protein monomers. 


Over the past few years, viruses and virus-like particles have 
received increasing attention in various areas of science, such 
as vaccinology’ and nanotechnology” due to their simple mass 
production, tunable shape and size, controllable self-assembly, 
and easy modification. Among these different viruses, the 
tobacco mosaic virus (TMV) has become a popular candidate 
for nanotechnology research due to its many functions. TMV 
serves as a biotemplate for depositing inorganic materials,’ acts 
as the carrier of tumor-associated carbohydrate antigens,’ 
exists as a promising building block for biosensors,’ and, 
as a nanotube, enhances the magnetoviscosity of ferrofluids.® 
TMV is a rod shaped particle consisting of 2130 identical coat 
protein subunits. It has a 300 nm length, an 18 nm outer 
diameter, and a 4 nm inner channel diameter. Moreover, TMV 
capsids can remain stable at temperatures up to 90 °C, and can 
withstand ranges of pH between 3.5 and 9.’ The TMVCP 
monomers have unique self-assembly characteristics and will 
form into disk-like and rod-like nanostructures under different 
conditions with or without RNA.® Moreover, it has been shown 
that the TMVCP can be manipulated chemically and genetically,” 
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which has resulted in many interesting applications, including 
a template for the growth of inorganic materials," a light 
harvesting system,” and a nanoenzyme.'” However, traditional 
techniques used for the modification of TMVCP have many 
drawbacks that limit the usefulness of the TMVCP to a great extent. 
For example, the labelling reactions of genetically introduced 
natural amino acids are not completely site-specific because the 
same amino acids at the other sites could still participate in the 
reactions. Furthermore, the chemical modifications on TMVCP 
typically require at least two steps, which are complex and time 
consuming. Also, the harsh conditions of many chemical reac- 
tions on proteins may lead to low yield and unstable products. 
Therefore, an effective method for site specific modification on 
TMVCP which does not significantly influence its self-assembly 
is needed. 

To solve the problems mentioned above, the amber codon 
technique was utilized. It has already been used to modify some 
viruses and bacteriophages.’* However, the amber codon techni- 
que has not yet been used on TMV, the most widely used and 
well characterized rod-shaped virus in nanotechnology. Using 
this technique, an arbitrarily large array of unnatural amino 
acids can be incorporated site-specifically into proteins by using 
a nonsense codon and orthogonal tRNA-synthetase pairs.’* The 
copper free cycloaddition reactions of azide groups with various 
dibenzylcyclooctyne derivatives are not only highly selective, but 
also have fast reactivity in biological solutions even at ambient 
temperatures.” In this communication, it is demonstrated, for 
the first time, the incorporation of an unnatural amino acid 
containing an azide group into TMVCP using the amber codon 
technique. Under different pH conditions, the mutant proteins 
still maintained their self-assembly capability, and formed into 
disk-like and rod-like structures. Biotin molecules were conju- 
gated on the mutant proteins in a completely site-specific way via 
the one step copper free cycloaddition reaction. These biotin- 
functionalized TMVCPs self-assembled into large disk-like and 
normal rod-like structures under different conditions. This 
approach enables a new way of modifying virus coat proteins 
with synthetic amino acids, and thus provides a simple and 
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Fig. 1 Genetic incorporation of KPN into TMVCP. (A) Structure of TMVCP. 
(B) Structure of the unnatural amino acid KPN. (C) MALDI-TOF mass 
spectra of the TMVCP and KPN11-TMVCP. (D) SDS-PAGE analysis of 
purified KPN11-TMVCP (lane 1, molecular mass marker; lane 2, TMVCP; 
lane 3, KPN11-TMVCP). 
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useful method to achieve the site-specific labelling of the rod- 
shaped virus-like particles. 

In order to express the mutant protein, the unnatural amino 
acid 2-amino-6-(((3-azidopropoxy)carbonyl)amino)hexanoic acid 
(KPN) (Fig. 1C) was synthesized. Once incorporated into the 
protein, the KPN is accessible to cycloaddition protocols for the 
conjugation of the small functional molecules. The Phe11 
(Fig. 1A) was chosen as the substitution site because it is located 
on the external surface of TMVCP, and therefore has a minimal 
impact on the self-assembly. 

An amber stop codon (TAG) was substituted for Phe11 in 
TMVCP by the directed mutation. In order to establish an 
orthogonal tRNA-aminoacyl-tRNA synthetase pair, which selec- 
tively charges the unnatural amino acid of KPN in response to 
the amber codon, three rounds of positive selections and two 
rounds of negative selections were carried out with a MbPylRS 
library.’° A KPN specific PyIRS mutant was identified through 
the screening, which was termed KPNRS. Compared with the 
wild type PylRS, three amino acids were mutated in its protein 
sequence: L274M, C313A and Y349F. Then the corresponding 
DNA sequence of KPNRS was ligated sequentially into SalI/BglII 
and PstI/Ndel restriction sites of the pEVOL vector,” and 
MbtRNAPY',, was cloned into the ApaLI/Xhol restriction site of 
the same vector. As a result, the plasmid of pEVOL-KPNRS- 
MbtRNAP), was generated. For the expression of the mutant 
TMVCP, the two plasmids of pEVOL-KPNRS-tRNAPY}, and 
pET23a-TMVCP-Phe11TAG were co-transformed into the £. coli 
BL 21 competent cell. The expression of the mutant TMVCP was 
induced with the addition of 1 mM KPN, 0.02% t-arabinose and 
0.5 mM IPTG (isopropyl B-p-thiogalactopyranoside) into the LB 
media. The protein cannot be expressed without the addition of 
KPN in the negative control (data not shown). The mutant 
protein was purified according to the procedure described by 
M. Endo et al.‘ and subsequently analyzed by SDS-PAGE 
(Fig. 1D). The results show a single band (lane 3) with the same 
molecular weight of TMVCP (lane 2), indicating the expression 
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and purification of KPN11-TMVCP. To further confirm the 
selective incorporation of KPN and not Phe11, MALDI-TOF 
analysis was performed. The MODLI-TOF mass spectra of wild 
type TMVCP showed a single peak at m/z 17477, and the mass 
spectra of the mutant protein with the substitution KPN for 
Phe11 (Fig. 1B), show a single peak at m/z 17 586 which matches 
the calculated KPN11-TMVCP molecular weight (17 584). These 
results confirm a high fidelity and specificity for the incorpora- 
tion of KPN into the TMVCP. 

To determine whether the KPN11-TMVCP had the same self- 
assembly characteristics as the TMVCP, the monomers were 
dialyzed against different buffers. The self-assembled nano- 
structures of KPN11-TMVCP were examined by transmission 
electron microscopy (TEM). The experiments were repeated 
three times, with reproducible results. The disk-like structure 
(Fig. 2A) was formed upon dialysis against 400 mM Tris-HCl, 
pH 7.6, and had an outer diameter of 18 nm. Once dialyzed 
against 100 mM sodium acetate, pH 5.5, rod-like structures 
(Fig. 2B) were yielded. These results match the values reported 
in the literature’? and indicate that the incorporation of KPN11 
into TMVCP has no influence on the assembly behaviours of 
monomer proteins. 

Dibenzocyclooctyne-PEG,-biotin (Fig. 3A) was used as a 
model molecule in the site-specific modification. It was chosen 
because biotin can specifically bind tightly to streptavidin with a 
dissociation constant (Ka) of 107 ‘* M,” and such a system is one 
of the most widely used affinity pairs in molecular,” immuno- 
logical,” and cellular assays.” The copper free cycloaddition 
reaction between dibenzocyclooctyne-PEG,-biotin and KPN11- 
TMVCP was performed at 4 °C in 400 mM Tris-HCl, pH 7.6 for 
6 h. The reaction product has an observed molecular mass of 
18336 Da (Fig. 3B) from the MALDI-TOF analysis, in close 
agreement with the calculated mass. The normalized intensity 
of the mass spectra of the product KPN11-TMV-biotin is 10 times 
larger than the intensity of the peak at 17586, which is the 
substrate KPN11-TMVCP left over from the reaction, indicating 
that the yield of this reaction was high. 


Fig. 2 TEM images of the self-assembly structures of KPN11-TMVCP. 
(A) Self-assembly of KPN11-TMVCP into disks; (B) self-assembly of KPN11- 
TMVCP into rods. 
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Fig. 3 Site-specific modification and assembly of KPN11-TMVCP. 
(A) Conjugation of dibenzocyclooctyne-PEG,-biotin with KPN11-TMVCP 
by “copper-free” cycloaddition reaction. (B) MALDI-TOF mass spectrum of 
the biotin coupled KPN11-TMVCP. (C) TEM image of the self-assembly 
KPN11-TMVCP-biotin into disks. (D) TEM image of the self-assembly 
KPN11-TMVCP-biotin into rods. 


To assess the self-assembly capacity of KPN11-TMVCP-biotin, 
the same dialysis experiments previously mentioned were per- 
formed. All the experiments were repeated three times with 
reproducible results. The TEM calibration (Fig. S1, ESI+) was 
tested after measurements to ensure consistency. As expected, 
the biotin-functionalized tube structures with an 18 nm diameter 
(Fig. 3D) were formed in 100 mM, pH 5.5 sodium acetate buffer. 
However, when the KPN11-TMVCP-biotin was dialyzed against 
a 400 mM Tris-HCl, pH 7.6 buffer, it formed the biotin- 
functionalized big disk-like structures (Fig. 3C) with about 
30 nm outer diameter and 16 nm inside diameter. A possible 
explanation is that the charges of the micro-environment where 
the biotin derivative occurs might have been changed due to 
the variation of the pH value. This change in the charges would 
perhaps have a substantial impact on the self-assembly of 
KPN11-TMVCP-biotin. As far as we know, this was the first 
time that a big disk-like structure was observed in the TMVCP 
self-assembly. Such a phenomenon might be helpful in the 
research into the TMVCP self-assembly mechanism and chemically 
controlled protein assembly. Moreover, the biotin-functionalized 
rod-like and disk-like nanoparticles forming from the procedure 
mentioned previously have many advantages, such as high-density 
biotin arrays and biocompatibility, which can be further used in 
imaging and therapy. 

We have successfully demonstrated, for the first time, the 
incorporation of an unnatural amino acid into the TMVCP and 
showed its characteristics of self-assembly into rod-like and 
disk-like structures. Moreover, the site-specific labelling with 
high yield was done via a copper free cycloaddition reaction. 
Additionally, the biotin-functionalized virus-like particles were 
built. We have proved that the dibenzocyclooctyne-PEG,-biotin 
modified TMVCP still has the capability of self-assembly. 
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In addition, the big disk-like structures were observed, which 
may provide insight into the self-assembly mechanism and the 
chemically controlled protein assembly. In conclusion, this 
work developed a new method for the site-specific labelling of 
TMVCP, which shows promise for a range of potential applica- 
tions in the field of molecular recognition, biosensing, and 
targeted drug delivery. 
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